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mechanism proposed is "selective stickiness", but this does'nt sound any ... 
discuss.foresight.org/critmail/ sci_nano.88-94/0564.html - 6k - Cached - Similar pages 

fpsi The Performance Potential of Data Dependence Speculation & ... 

File Format: Adobe PostScript - View as Text 

... When the correct address becomes available and a misprediction. 1 Note that the 
load speculation mechanism uses a subset of the mechanisms presented in [5]. ... 
www2.cs.ucy.ac,cy/~yanos/papers/micro29.ps - Similar pages 

[pdf] Observation of the CIV effect in interstellar clouds: a ... 
File Format: PDF/Adobe Acrobat - View as HTML 

... 6, DECEMBER 2000 Observation of the CIV Effect in Interstellar Clouds: A Speculation 
on the Physical Mechanism for Their Existence Anthony L. Peratt, Fellow ... 
public.lanl.gov/alp/plasma/Perattpdf/ PerattVerschuurDec2000.pdf - Similar pages 

Mechanisms of Action of Conjugated Linoleic Acid: Evidence and ... 

... Mechanisms of Action of Conjugated Linoleic Acid: Evidence and Speculation. ... activities 

for CLA that beg the question of underlying biochemical mechanism. ... 



h g gee e ch ec ech he e e 



Google Search: speculation med^ism 



Page 2 of 2 



www.blackwell-synergy.com/links/doi/1 0. 1 046/ j. 1 525-1 373.2000.22302. x/abs/ - Similar pages 
The Mechanism of Plate Tectonics 

... Introduction; Speculation: There appears to be no agreed upon explanation 
of the mechanism that drives plate tectonics. (The notions ... 
webspinners.com/dlblanc/tectonic/mechansm.html - 24k - Cached - Similar pages 



< Gooooooooooo-ogle ► 

Result Page: Previous 1 2 3 4 5 6 7 8 9 101112 Next 



speculation mechanism 



Search within results 



Google Home - Advertise with Us - Business Solutions - Services & Tools - Jobs, Press, & Help 

©2003 Google 



h g gee e ch ec ech he e e 



0 

Integrated Predicated and SpecuL^e Execution in the IMPACT EPIC ArAkecture - Au... Page 1 of 2 



View or download: 

Integrated Predicat d and Speculative Execution uiuc.edu7pub/IMPACT/c...isca98epic.ps 
in the IMPACT EPIC Architecture (1998) (Make uiuc.edu/IMPACT/ftp/c... isca98epic.ps 

Corrections) (24 citations) uiuc.edu/ece412/pape.. .isca98epic.pdf 

David I. August, Daniel A. Connors, Scott A Mahlke, John Cached: PS^gz PS PDF DjVu Image Update He!e 
W. Sias, Kevin M. Crozier, Ben-Chung Cheng, Patrick R. 

Eaton, Qudus B. Olaniran, Wen-mei W. Hwu From: uiuc.edu/IMPACT/pap.. .fconference (more) 

Proceedings of the 25th annual international symposium From: uiuc.edu/lmpact/people/.. .bcchenq 

on Computer architecture Homepages: DAug ust D.Connors 

S. Mahlke J. Sias 

CfteSfeer Home/Search Context Related K.Crozier B.Chenq El 131 Bl 

UICJCCI P.Eaton {21 Q.OIaniran 



W.Hwu HPSearch (Update Links) 

(Enter summary) Rate this article: 1 2 3 4 5 (best) 

Abstract: Explicitly Parallel Instruction Computing (EPIC) architectures Comment on this article 

require the compiler to express program instruction level parallelism directly to the hardware. EPIC techniques 
which enable the compiler to represent control speculation, data dependence speculation, and predication have 
individually been shown to be very effective. However, these techniques have not been studied in combination 
with each other. This paper presents the IMPACT EPIC Architecture to address the issues involved in... (Update) 
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with the question, Is general percolation really optimal A study combining sentinel scheduling with 
predicated execution for C programs [2] reported that for their benchmarks, 31 of cache misses and 13 
TLB misses and page faults were spurious and thus could potentially be 
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1 Concert-efficient runtime support for concurrent object-oriented 
U programming languages on stock hardware 

V. Karamcheti , A. Chien 

Proceedings of the 1993 ACM/IEEE conference on Supercomputing December 
1993 
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2 MaJIC: compiling MATLAB for speed and responsiveness 85% 
Eft George Almasi , David Padua 

L - ' ACM SIGPLAN Notices , Proceedings of the ACM SIGPLAN 2002 Conference on 
Programming language design and implementation May 2002 
Volume 37 Issue 5 

This paper presents and evaluates techniques to improve the execution performance of 
MATLAB. Previous efforts concentrated on source to source translation and batch 
compilation; MaJIC provides an interactive frontend that looks like MATLAB and 
compiles/optimizes code behind the scenes in real time, employing a combination of 
just-in-time and speculative ahead-of-time compilation. Performance results show that 
the proper mixture of these two techniques can yield near- zero response time as ... 



3 Performance monitoring: TEST: a tracer for extracting speculative 80% 
2) threads 

Michael Chen , Kunle Olukotun 

Proceedings of the international symposium on Code generation and 
optimization: feedback-directed and runtime optimization March 2003 
Thread-level speculation (TLS) allows sequential programs to be arbitrarily 
decomposed into threads that can be safely executed in parallel. A key challenge for 
TLS processors is choosing thread decompositions that speedup the program. Current 
techniques for identifying decompositions have practical limitations in real systems. 
Traditional parallelizing compilers do not work effectively on most integer programs, 
and software profiling slows down program execution too much for real-time analysis. 
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4 A compiler framework for speculative analysis and optimizations 80% 

Jin Lin , Tong Chen , Wei-Chung Hsu , Pen-Chung Yew , Roy Dz-Ching Ju , Tin-Fook Ngai , 
^ Sun Chan 

ACM SIGPLAN Notices , Proceedings of the ACM SIGPLAN 2003 conference on 
Programming language design and implementation May 2003 
Volume 38 Issue 5 

Speculative execution, such as control speculation and data speculation, is an effective 
way to improve program performance. Using edge/path profile information or simple 
heuristic rules, existing compiler frameworks can adequately incorporate and exploit 
control speculation. However, very little has been done so far to allow existing 
compiler frameworks to incorporate and exploit data speculation effectively in various 
program transformations beyond instruction scheduling. This paper proposes a ... 



5 Using dataflow analysis techniques to reduce ownership overhead in 80% 
cache coherence protocols 

Jonas Skeppstedt , Per Stenstrom 

ACM Transactions on Programming Languages and Systems (TOPLAS) November 
1996 

Volume 18 Issue 6 

In this article, we explore the potential of classical dataflow analysis techniques in 
removing overhead in write-invalidate cache coherence protocols for shared-memory 
multiprocessors. We construct the compiler algorithms with varying degree of 
sophistication that detect loads followed by stores to the same address. Such loads are 
marked and constitute a hint to the cache to obtain an exclusive copy of the block so 
that the subsequent store does not introduce access penalties. The simplest ... 



6 The Jrpm system for dynamically parallelizing Java programs 77% 

f^j Michael K. Chen , Kunle Olukotun 

ACM SIGARCH Computer Architecture News , Proceedings of the 30th annual 
international symposium on Computer architecture May 2003 
Volume 31 Issue 2 

We describe the Java runtime parallelizing machine (Jrpm), a complete system for 
parallelizing sequential programs automatically. Jrpm is based on a chip 
multiprocessor (CMP) with thread-level speculation (TLS) support. CMPs have low 
sharing and communication costs relative to traditional multiprocessors, and thread- 
level speculation (TLS) simplifies program parallelization by allowing us to parallelize 
optimistically without violating correct sequential program behavior. Using a Java 
virtual ma ... 



7 Cool-Cache: A compiler-enabled energy efficient data caching 77% 
Q) framework for embedded/multimedia processors 

Osman S. Unsal , Raksit Ashok , Israel Koren , C. Mani Krishna , Csaba Andras Moritz 
ACM Transactions on Embedded Computing Systems (TECS) August 2003 
Volume 2 Issue 3 

The unique characteristics of multimedia/embedded applications dictate media- 
sensitive architectural and compiler approaches to reduce the power consumption of 
the data cache. Our goal is exploring energy savings for embedded/multimedia 
workloads without sacrificing performance. Here, we present two complementary 
media-sensitive energy-saving techniques that leverage static information. While our 
first technique is applicable to existing architectures, in our second technique we adopt 
a more rad ... 
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8 Speculation: Compiler support for speculative multithreading 77% 
architecture with probabilistic points-to analysis 

Peng-Sheng Chen , Ming-Yu Hung , Yuan-Shin Hwang , Roy Dz-Ching Ju , Jenq Kuen Lee 
Proceedings of the ninth ACM SZGPLAN symposium n Principles and practice of 
parallel pr gramming June 2003 

Speculative multithreading (SpMT) architecture can exploit thread-level parallelism 
that cannot be identified statically. Speedup can be obtained by speculatively 
executing threads in parallel that are extracted from a sequential program. However, 
performance degradation might happen if the threads are highly dependent, since a 
recovery mechanism will be activated when a speculative thread executes incorrectly 
and such a recovery action usually incurs a very high penalty. Therefore, it is essent ... 



9 How to solve the current memory access and data transfer bottlenecks: 77% 
at the processor architecture or at the compiler level 

Francky Catthoor , Nikil D. Dutt , Christoforos E. Kozyrakis 

Proceedings of the conference on Design, automation and test in Europe January 
2000 

r 10 Integrated predicated and speculative execution in the IMPACT EPIC 77% 
* 12 architecture 

David I. August , Daniel A. Connors , Scott A. Mahlke , John W. Sias , Kevin M. Crozier , 

Ben-Chung Cheng , Patrick R. Eaton , Qudus B. Olaniran , Wen-mei W. Hwu 

ACM SIGARCH Computer Architecture News , Proceedings of the 25th annual 

international symposium on Computer architecture April 1998 

Volume 26 Issue 3 

Explicitly Parallel Instruction Computing (EPIC) architectures require the compiler to 
express program instruction level parallelism directly to the hardware. EPIC techniques 
which enable the compiler to represent control speculation, data dependence 
speculation, and predication have individually been shown to be very effective. 
However, these techniques have not been studied in combination with each other. This 
paper presents the IMPACT EPIC Architecture to address the issues involved in 
design ... 

11 Evaluation of scheduling techniques on a SPARC-based VLIW testbed 77% 

Seongbae Park , SangMin Shim , Soo-Mook Moon 

Proceedings of the 30th annual ACM/IEEE international symposium on 

Microarchitecture December 1997 

The performance of Very Long Instruction Word (VLIW) microprocessors depends on 
the close cooperation between the compiler and the architecture. This paper evaluates 
a set of important compilation techniques and related architectural features for VLIW 
machines. The evaluation is performed on a SPARC-based VLIW testbed where gcc- 
generated optimized SPARC code is scheduled into high-performance VLIW code. As a 
base scheduling compiler, we experiment with three core scheduling techniques 
including ... 

12 Simple compiler algorithms to reduce ownership overhead in cache 77% 
@) coherence protocols 

Jonas Skeppstedt , Per Stenstrom 

Proceedings of the sixth internati nal conference on Architectural support f r 
programming languages and operating systems November 1994 
Volume 29 , 28 Issue 11,5 

We study in this paper the design and efficiency of compiler algorithms that remove 
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ownership overhead in shared-memory multiprocessors with write-invalidate protocols. 
These algorithms detect loads followed by stores to the same address. Such loads are 
marked and constitute a hint to the cache to obtain an exclusive copy of the block. We 
consider three algorithms where the first one focuses on load-store sequences within 
each basic block of code and the other two analyse the existence of I ... 
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